AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
Efficient Quantization Deployment

# Efficient Quantization Deployment

Meta Llama 3.1 8B GGUF
The GGUF quantized version of Meta-Llama-3.1-8B, generated using the llama.cpp tool, supports multilingual text generation tasks.
Large Language Model Supports Multiple Languages
M
fedric95
253
3
Meta Llama Llama 4 Scout 17B 16E Instruct Old GGUF
Other
Llama-4-Scout-17B-16E-Instruct is a 17B parameter instruction fine-tuned large language model released by Meta, which has undergone quantization processing to improve operational efficiency.
Large Language Model Supports Multiple Languages
M
bartowski
3,142
30
Minicpm O 2 6 Gguf
MiniCPM-o 2.6 is a multimodal model that supports vision and language tasks, specifically designed for llama.cpp.
Image-to-Text
M
openbmb
5,660
101
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase